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Abstract: A novel human coronavirus (CoV) has been identified as the etiological agent that 
caused the severe acute respiratory syndrome (SARS) outbreak in 2003. The spike (S) protein of 
this virus is a type I surface glycoprotein that mediates binding of the virus to the host receptor and 
the subsequent fusion between the viral and host membranes. Because of its critical role in viral 
entry, the S protein is an important target for the development of anti-SARS CoV therapeutics and 
prophylactics. This article reviews the structure and function of the SARS CoV S protein in the 
context of its role in virus entry. Topics that are discussed include: the interaction between the S1 
domain of the SARS spike protein and the cellular receptor, angiotensin converting enzyme 2 
(ACE2), and the structural features of the ectodomain of ACE2; the antigenic determinants 
presented by the S protein and the nature of neutralizing monoclonal antibodies that are elicited in 
vivo; the structure of the 4,3-hydrophobic heptad repeats HR1 and HR2 of the S2 domain and their 
interaction to form a six-helical bundle during the final stages of fusion. Opportunities for the 
design and development of anti-SARS agents based on the inhibition of receptor binding, the 
therapeutic uses of S-directed monoclonal antibodies and inhibitors of HR1—HR2 complex 
formation are presented. © 2006 Wiley Periodicals, Inc. Med Res Rev, 26, No. 4, 414-433, 2006 
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1. INTRODUCTION TO THE SARS Cov 


Severe acute respiratory syndrome (SARS) is an atypical pneumonia that initially emerged from 
Guangdong Province in the southern region of China around November 2002. This virus 
subsequently spread rapidly to 25 countries around the world within a very short period of time, 
infecting more than 8,000 people and causing more than 700 deaths by the time the outbreak ended in 
July 2003. '? The mortality rate from SARS is approximately 15%, which compares with less than 1% 
for influenza infection, and shows a marked dependency on age. The unanticipated outbreak caused 
enormous anxiety and uncertainty for public health management authorities based on the unknown 
cause and the paucity of effective therapeutics available. The SARS outbreak was ultimately 
contained by a concerted effort that included patient isolation, intensive control of infection in 
hospitals, traditional quarantine measures, and the issuing of a travel advisory that was enforced by 
the World Health Organization. 

The etiological agent of SARS was quickly identified as a coronavirus (CoV) in March 2003 after 
a unique, global collaborative effort between health and scientific organizations. Phylogenetic 
analysis of the amino acid sequence of this newly discovered CoV revealed it to be distinct from the 
previously characterized group 1, 2, and 3 coronaviruses, *** although extensive genome and 
proteome analyses suggested that it is distantly related to group 2 coronaviruses, for example human 
CoV OC43 and murine hepatitis virus (MHV).° Like other coronaviruses, the SARS CoV is an 
enveloped, positive-stranded RNA virus that contains a large genome, comprised of 29,740 bases that 
constitute 14 open reading frames. The two large, overlapping open reading frames, ORF 1a and 1b, 
that comprise approximately two-thirds of the genome, encode a single polyprotein containing the 
replicative enzymes, while the downstream open reading frames encode the structural proteins in the 
following order: spike (S), envelope (E), membrane (M), and nucleocapsid (N). In between these four 
structural genes are eight predicted open reading frames, referred to as accessory genes, that encode 
non-structural proteins of unknown function.*4°7 

Interferons, ribavirin, and some HIV protease inhibitors, particularly Kaletra, a combination of 
lopinavir and low-dose ritonavir, were used empirically to treat SARS-infected patients in some 
affected countries during the outbreak.* SARS CoV vaccines, which offer the promise of an effective 
means of preventing and controlling potential future outbreaks, remain under active development.” 
To date, there are no specific and efficacious antiviral drugs readily available to prevent or treat SARS. 
Although widespread SARS CoV infections have not re-emerged following the initial outbreak, 
sporadic cases were reported in 2004! and the possibility of an outbreak emerging from the natural 
reservoir remains unpredictable.” Consequently, effective antiviral agents against SARS CoV are 
desirable and, in contrast to influenza infection, viral load in the respiratory tract of a SARS Co 
V-infected individual peaks at day 10 following the onset of clinical symptoms, providing a wider 
window for treatment. 

The SARS CoV surface glycoprotein, designated the spike or S-protein, is responsible for 
attachment and entry of the virus to host cells, the critical early step in the life cycle of the virus, and 
represents an important potential target for therapeutic intervention. Similar to HIV,'° the entry of the 
SARS CoV into host cells takes between 10 and 20 min, based on observations viewed by electron 
microscopy during infection of Vero E6 cells in culture.'' Although detailed kinetic studies have not 
been reported, there continues to be tremendous interest in understanding precisely the mechanistic 
steps associated with SARS CoV entry, and several significant findings have been reported within a 
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short period of time. This article summarizes developments in this area, reviewing the structure and 
function of the SARS S protein,'”'? its antigenic determinants and the mechanism of viral entry as it 
is presently understood. We highlight the opportunities for therapeutic intervention in this process 
that may potentially provide a treatment for SARS CoV infection. '*~'® 


2. SARS CoV SPIKE PROTEIN 


As a newly discovered virus that is distinct from the three known groups of cornonavirus, the SARS 
CoV S protein shares only 20% —27% amino acid identity with the group 1, 2, and 3 cornonaviruses.° 
Figure 1 shows a schematic organization of the SARS CoV S protein, noting the approximate 
positions of the various functional domains that have been identified to date. At the N-terminal is a 
short secretory signal sequence thought to be cleaved co-translationally. The $1 domain is a globular 
structure that is responsible for binding to the host cell receptor, which has been identified as 
angiotensin converting enzyme 2 (ACE2)'” (Section 4). Sequence analysis has predicted that the S 
protein contains heptad repeat regions of the type found in several other viral fusion proteins.* 
Computer sequence predictions together with systematic amino acid mapping studies of the stalk- 
like S2 domain have located the HR1 region to amino acids 896-972, while the HR2 region extends 
from amino acids 1142 to 1188 (Fig. 1).*!8~*3 Biophysical analysis showed that the isolated HR1 
domain is highly helical and itself assembles into a thermally stable coiled coil, whereas the HR2 
domain has low helical content and is less stable.'**' The HR1 and HR2 elements associate into a 
stable, antiparallel, six-helix bundle in which the HR1 is assembled centrally as a triple-stranded 
coiled coil, and to which the HR2 elements are bound!*-7? (Section 3). This intramolecular 
interaction is common to several other viruses and is thought to be the mechanism by which viral and 
host membranes are brought into close proximity as a prelude to membrane fusion. As a consequence, 
the SARS CoV S protein is characterized as a class I integral membrane protein, sharing membership 
with other class I viral envelope glycoproteins, including the S protein of related group 2 
cornonaviruses, MHV,**> HIV gp41, influenza hemagglutinin HA2,”° the respiratory syncytial 
virus (RSV) F protein,”’ and the simian parainfluenza virus 5 (SV5) F protein.”® The S2 domain of the 
SARS CoV is more conserved than the S1 domain, with the HR1 and HR? regions exhibiting higher 
identity with other cornonaviruses, 50% and 32%, respectively.'® Interestingly, circular dichroism 
(CD) analysis and proteinase K digestion studies have suggested that the HR2 domain binds to HR1 
with a combination of helical and extended conformations in the HR1—HR2 core complex.'*7° Such 
a structural feature, which resembles those of MHV~ and SV5,”* was subsequently confirmed by 
X-ray crystallographic studies (Section 3). The HR1 and HR2 regions are separated by a 170 amino 
acid interdomain loop that is similar in length to that of the corresponding region in the SVS F protein. 
This relatively long interdomain loop is thought to counterbalance the short region of the S protein 
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Figure 1. Schematic organization of SARS CoV S protein, showing the S1 and the S2 domains, and the approximate amino acid 
positions of the various functional domains (the size of each domain is not drawn to exact scale)/8-22806162 A: A trytophan and 
tyrosine rich aromatic region.22°7 C: A cysteine rich region. [Color figure can be viewed in the online issue, which is available at 
www.interscience.wiley.com] 
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that is proximal to the viral membrane, providing the flexibility required for the conformational 
change that occurs during the membrane fusion process.”! This region is also noted for its high 
hydrophobicity and high activity in a membrane leakage model, which measures the activity of S 
protein-derived synthetic peptides to rupture phospholipid vesicles encapsulating a fluorescence 
probe molecule, suggesting an important role in facilitating the fusion process.”? 

Type I surface glycoproteins are typically processed proteolytically into individual subunits 
comprised of a membrane-bound fusion protein and a receptor-binding subunit that are associated 
non-covalently or through the agency of disulfide bonds and hydrogen-bonding interactions. 
Proteolytic cleavage usually occurs proximal to the fusion peptide, unmasking but not exposing this 
critical mediator of the fusion process. However, proteolytic cleavage of the SARS CoV S protein into 
separate S1 and S2 domains has not been observed, consistent with the fact that it does not contain the 
basic amino acid cleavage site (e.g., RRXRR) that is found in group 2 and 3 cornonaviruses.° 
Consequently, the boundary of the S1 and S2 domains has not been precisely located but it is 
predicted to be in the vicinity of amino acid 758 based on sequence analogy to MHV.*° Cleavage of 
the S protein is not a necessary prerequisite for viral entry,?°*°' although cleavage of the S protein 
appears to enhance cell-cell fusion, as observed between 293T cells expressing the SARS CoV S 
protein and Vero E6 cells.*” In the absence of proteolytic cleavage of the S protein, SARS CoV—host 
membrane fusion will depend on the activity of an internal fusion peptide that has been predicted to 
reside between amino acids 851 and 890,7°7! a region rich in alanine and glycine residues located 
near the N-terminal of HR1. However, results from membrane partitioning and leakage studies using 
S protein-derived synthetic peptides suggest that this fusion peptide may reside further towards the 
C-terminal, starting from amino acid 770.°> Possible proteolytic processing of the S protein was 
proposed based on the detection of S2 fragments in the cell lysates of SARS CoV-infected Vero E6 
cells by S2-specific monoclonal antibodies.**~-*° However, it should be noted that isolation of the 
corresponding S1 fragments were not reported. 

A small stretch of approximately 12 amino acids rich in tyrosine and trytophan residues has been 
identified in between HR2 and the transmembrane anchor. This highly hydrophobic, aromatic region 
shows a high propensity to partition into a lipid bilayer in a membrane leakage model system.7”>” 
This region, together with the interdomain loop mentioned above, may help to destabilize lipid 
bilayers, providing an additional impetus for fusion between the viral and host membranes. 
Immediately following the transmembrane domain is a short, cysteine-rich region within the 
cytoplasmic domain that is conserved among coronaviruses"! and is required for fusion activity.°* 
In its native form, the full length S protein exists as a stable and non-covalently complexed 
homotrimer with C-terminal deletion studies suggesting that the transmembrane and/or cytoplasmic 
domains are important for trimerization.*!*? 

Enveloped viruses expressing surface glycoproteins enter host cells by two well-established 
pathways. One is through receptor binding-induced conformational changes of the virus 
glycoproteins, which leads to direct fusion of the viral and host membranes at the cell surface. 
This process is used by HIV and RSV. The second pathway is one of receptor-mediated endocytosis in 
which the virion is taken into the cell in an endosomal compartment. As the pH of the endosome falls 
during transit to the lysozome, the acidic environment triggers the conformational changes of the 
virus glycoproteins that are required for membrane fusion. This paradigm is used most prominently 
by influenza virus. Whether the entry of SARS CoV is pH dependent or pH-independent remains to be 
established, with current experimental evidence not definitive. The entry of the SARS CoV S protein- 
pseudotyped virus on a HIV vector into Vero E6 cells, 293T cells, or Huh7 cells was inhibited by high 
(millimolar) concentration of NH,Cl, suggesting a pH-dependent endocytotic process.*?4°-* 
However, these results were contradicted by another study‘? using lower, 1M concentrations of 
NH, Cl and the observations from an assay in which fusion between cells separately expressing the 
SARS CoV S protein and the ACE? receptor occurred efficiently at neutral pH” Moreover, entry of 
the SARS CoV into Vero E6 cells via membrane fusion at the cell surface rather than an endocytotic 
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pathway was observed by electron microscopy.** It should also be noted that CD-analysis showed 
that the helicity of a HR1 peptide (amino acids 882—973) did not increase significantly at pH 4.5 when 
compared to neutral pH? suggesting that low pH did not induce a significant conformational change. 
Very recent studies on the entry of Ebola virus into host cells, a process mediated by the disulfide- 
linked surface glycoprotein GP1-GP2, using a pseudotyped system to infect Vero cells, have revealed 
a third process for triggering virus and host membrane fusion.*° For Ebola virus entry, this process 
appears to be dependent on an initial proteolytic cleavage of GP1 by the endosomal cysteine 
proteases, capthepsin B, or L followed by a second proteolytic step mediated by capthepsin B.* The 
pH studies on SARS CoV entry as described above do not exclude such a mechanism. It is also 
intriguing, as noted, that the detection of S2 fragments in the cell lysates of SARS CoV-infected Vero 
E6 cells were reported. 


3. THE $2 DOMAIN AND 4,3-HYDROPHOBIC HEPTAD 
REPEATS (HR1, HR2) OF THE SARS CoV S PROTEIN 


Very recently, X-ray crystallographic structures of a complex of HRI and HR2 that represent the 
fusion active form of the S protein of the SARS CoV have been solved at 1.6,*° go * and 2.8 A*® 
resolution. These structures validated conclusions drawn earlier from biophysical studies. As 
depicted in Figure 2, the three HR1 units (amino acids 890-973) associate into a central, parallel 
trimeric coiled coil structure to which the three HR2 elements (amino acids 1147-1188) bind into 
grooves in the HR1 complex in an oblique, antiparallel direction.*” The HR2 units each consist of a 
short five-turn o-helix (amino acids 1161-1177), which contrasts with the longer nine-turn helix 
observed in the HR2 elements of both HIV gp41°° and the RSV F protein.’ Ateach end of the N- and 
C-terminus of the HR2 helix, the peptide chains exist in an extended conformation. As revealed by 
another structure of the HRI—HR2 complex (PDB 2BEZ),*° the extended HR2 region at the N- 
terminal spans five additional residues from Asn1142 to Thr1 160, which complements residues 962— 
929 of the HR1 coiled coil. The interaction between HR1 and HR2 is largely hydrophobic, with the 
HR2 helical region binding into deep grooves in the complex while the extended regions bind in 
shallower grooves.** Both of the extended regions of HR2 consolidate their interaction with HR1 by 
several B-sheet-like hydrogen bonds formed between the primary amide groups of the side chains of 
Asp and Gin of HR1, and the main chain carbonyls and amide NHs of HR2.*’ The N-terminal residues 
of HR1, amino acids 890-892 in the complex shown in Figure 2, are in an extended conformation, 
while residues 885—889 of the HR1 construct used in these crystallographic studies are disordered, 
suggesting that this region, which connects to the fusion peptide, is flexible to allow the projection of 
the fusion peptide into the host membrane. 

Using a strategy similar to that utilized in the discovery of inhibitors of HIV gp41-mediated 
fusion that ultimately produced the drug enfuvirtide (T-20), peptides derived from the heptad repeats 
HR1 and HR2 of the S2 domain of the SARS CoV virus were evaluated for their capacity to prevent 
infection of Vero cells. In this cell-based assay, the C-terminal HR2-derived peptides were inhibitory 
but peptides derived from the N-terminal HR1 were not, results consistent with observations from the 
related MHV CoV area.”* A 68-mer peptide HR2-8 (amino acids 1126-1193) showed inhibitory 
activity, with an ECs of 17 1M.?° Similarly, a shorter HR2-derived peptide, CP-1 (amino acids 
1153-1189) exhibited an ECsp of 19 uM.!° A sequence derived from HR2-18 (amino acids 1161— 
1187) displayed an ECs of 5.2 uM.“ These HR2 peptides, which contain the key HR1-interacting 
region defined by amino acids 1142-1185, are thought to inhibit fusion by interfering with the 
formation of the fusion-active six-helix bundle, binding to HRI during the pre-hairpin state. 
Inhibition of this type is sequence specific, since a peptide derived from the MHV HR2 region, 
designated MHV-HR2, was inactive against SARS CoV fusion.”° However, the inhibitory potency of 
these SARS CoV HR2 peptides is considerably lower than the sub-micomolar activity measured 
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Figure 2. Crystal structure of the SARS CoV S2 HRI-HR2 core complex (HRI (red): amino acids 890-973; HR2 (blue) amino acids 
1147-1188). HR1-N: HR1 N-terminus; HR2-C: HR2 C-terminus. The structure is adapted from PDB 1WYY~* and by using WebLab 
ViewerPro 5.0 (Accelrys). [Color figure can be viewed in the online issue, which is available at www.interscience.wiley.com] 


for the corresponding inhibition of MHV fusion by the MHV-HR2 peptide, which exhibits an ECs, of 
0.9 uM.?° This is attributed, in part, to a weaker interaction within the SARS CoV six-helix bundle, as 
indicated by the lower thermal stability when compared to that of MHV. A comparison of the SARS 
CoV six-helix bundle structure with that of MHV”° revealed the structural basis for the weaker 
interaction.*° First, a larger void volume exists inside the SARS CoV coiled coil due to the presence 
of the smaller side chain of Leu at position 920 of HR1, instead of the larger side chain of 
the conserved Phe at this position (SARS CoV numbering) among other cornonaviruses. Second, the 
SARS CoV HRI! has shallower hydrophobic pockets that interacts with smaller side chains of HR2 
residues in a few positions, when compared to MHV (e.g., Ala in SARS CoV instead of Val as in 
MHYV). Third, the buried surface area of the HR1-HR2 groove interface of SARS CoV is 10% less 
than in the MHV complex. Nevertheless, it should be noted that a peptide derived from HR2 of the 
SARS CoV, HR2-38, (amino acids 1149-1186) was shown to have ECs, below 5 nM in an infection 
assay using Vero E6 cells.”* This HR2 peptide is almost identical to the one shown in Figure 2 and the 
potency suggests that this peptide represents the optimal interaction with HR1 domain. Another 
contradictory result, that the HR1-derived peptide HR1-1 (amino acids 889-926) exhibited 
inhibitory activity with an ECso of 3.7 1M, has also been reported.*” These discrepancies may be due 
to physicochemical properties of peptides used and different assay conditions, including longer 
incubation times and different read-out methods. 
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In addition to the peptide-based inhibitors, small molecule inhibitors have been identified using 
an assay that measures the binding of compounds to a S2 protein (amino acids 733-1190) expressed 
in E. coli. Tetra-O-galloyl-B-p-glucose and luteolin were found to be inhibitors in the binding assay, 
with micromolar ECs0s, and shown to prevent infection of Vero E6 cell by both pseudotype and wild- 
type SARS CoV. An examination of the six-helix bundle crystal structure identified a hydrophobic 
pocket located in HR1 as a potential target for the design of small molecule inhibitors.*° This pocket 
is occupied by the side chains of Leu1148 and Val1146 from the extended conformation region of 
HR2 in the six-helix bundle. The deeper grooves into which the HR2 helical regions bind could also 
be considered as a potentially viable target for small molecule inhibitors.** The feasibility of this 
approach has been demonstrated recently with the discovery of small molecules that target a similar 
hydrophobic pocket in HR1 of the RSV F protein.°’ BMS-433771 is a potent, orally bioavailable 
RSV fusion inhibitor that exhibits antiviral efficacy in murine model of RSV infection.*”* 


4. THE CELLULAR RECEPTOR FOR THE SARS CoV: ANGIOTENSIN 
CONVERTING ENZYME 2 (ACE2) 


The ACE2 protein has been identified as the host receptor for the SARS CoV, recognizing the S 
protein and mediating virus binding to the host cell and subsequent entry. nae Interestingly, ACE2 
itself was discovered only recently (in 2000) using a genomic-based approach and characterized as an 
enzyme homologous to ACE.*°*’ Although both enzymes are zinc metalloproteases that function as 
carboxypeptidases, ACE is a peptidyl dipeptidase while ACE2 is a type I integral membrane protein 
that cleaves only one amino acid from the C-terminal of substrates. ACE2 is mainly expressed in the 
heart, kidney, and testes and, like ACE, appears to be an important enzyme in the renin-angiotensin 
cascade. While ACE hydrolyses the decapeptide angiotensin I (AI) to the vasoconstrictor AII, ACE2 
acts in conjunction with ACE to convert AI to the heptapeptide A1-7, which is a vasodilator. While it 
has been established that ACE2 plays an essential role in the regulation of cardiac function,>® other 
physiological functions of this enzyme remain to be discovered. More recently, ACE2 was found to 
be highly expressed on lung alveolar epithelial cells and enterocytes of the small intestine,” 
locations consistent with the pathogenesis of SARS CoV infection. Although SARS appears mainly 
as arespiratory disease, the SARS CoV was shown to infect the human liver cell line Huh7,*"? which 
expresses high levels of ACE2. 

In the absence of an X-ray crystal structure of the SARS CoV S protein bound to ACE2, 
biochemical studies are being used to discern the nature of the interaction between these proteins. In 
the process of identifying ACE2 as the SARS CoV receptor, the binding of the $1 domain to the 
receptor was clearly demonstrated, with amino acids 318-510 of S1 shown to be a critical 
recognition element.*°°*! Point mutation studies indicated that Glu452 and Asp454 in this region 
are responsible for key interactions with the ACE2 receptor, while five Cys residues, 348, 366, 419, 
467, 474, are also essential elements in the association with ACE2.°! A soluble form of the $1 domain 
associates into a dimer which binds to ACE2 more avidly than the monomer. The S1 dimerization 
domain has been mapped to amino acids 17—217, a region closer to the amino terminus of S1 that is 
also essential for membrane fusion.™ 

The peptidase activity of ACE2 was shown to make no contribution to the S protein-mediated 
entry of the SARS CoV. An ACE? variant, in which the active site histidine residues 374 and 378 were 
modified to asparagines, proved to be equally effective as a receptor for virus entry as the wild-type 
ACE2.™ Moreover, truncation of the cytoplasmic domain of ACE2 from 42 to 9 amino acids did not 
affect the efficiency of SARS CoV infection in a pseudotype assay, suggesting that this domain was 
not essential for ACE2-mediated entry of the virus.®° A soluble form of ACE2 conjugated to an 
human IgG1 fragment was shown to bind with high affinity to S1, Kd = 1.7 nM, as measured by 
surface plasmon resonance (Biacore) methodology. More recently, crystallographic structures of 
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the ectodomain of ACE2 in native form and bound to the active site-directed inhibitor MLN-4760 
have been reported.°” As shown in Figure 3, the zinc-containing active site of ACE2 is located deep 
inside and the near the bottom of a long canyon formed between the N-terminal subdomain I and the 
C-terminal subdomain II. A notable structural difference between the native and inhibitor-bound 
forms is that there is an inhibitor-induced repositioning of the two subdomains by 16° relative to each 
other, such that the open cleft closes to wrap around the inhibitor. The outer edge region of one 
subdomian moved as much as 13 A. These crystal structure data, together with the mapping studies 
that have identified the location of the receptor-binding domain on S1 of the CoV S protein, provide a 
basis for molecular modeling of the binding interaction between the two proteins. Although the 
sequence similarity between the SARS CoV S protein and the S protein of other known coronaviruses 
is very low and X-ray structures of these proteins are not yet known, attempts have been made to use 
predictions of secondary structure and fold recognition approaches to construct a model for the SARS 
CoV S protein.©*-7° Early modeling studies based on a homology model of human ACE2 and a 


Figure 3. (a) Crystal structures of the ectodomain of human ACE2 in its native form, and (b) with inhibitor MLN-4760 (blue ball and 
stick) boundinthe active site. The active site Zn2* was shown as an enlarged yellow ball for clarity. The subdomain llin Figures (a) and 
(b) was drawn to the same orientation such that the relative conformational change of subdomain | is obvious. The structures are 
adapted from PDB 1R42 and 1RA4L, respectively®” and by using WebLab ViewerPro 5.0 (Accelrys); (c) Structure of MLN-4760 andiits 
ACE2 inhibitory potency. [Color figure can be viewed in the online issue, which is available at www.interscience.wiley.com] 
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Figure 3. (continued) 


threading model of the known receptor-binding domain of $1 suggested that negatively charged 
ridges surrounding the ACE2 catalytic site are complementary to the largely positively charged 
surface of the receptor-binding domain of the S protein. 

Inhibition of the attachment of the SARS CoV S protein to ACE2 is a strategy with potential to 
identify inhibitors for the treatment of SARS infection. A peptide comprised of amino acids 318-510 
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of the S protein conjugated to the Fc domain of human IgG! (RBD-Fc), inhibited S protein-mediated 
viral entry with an ECs) of < 10 nM ina pseudotype assay. This hybrid peptide was more potent than 
the corresponding conjugate of the full length S1, which showed an ECs9 of 50 nM.°' The difference 
in activity may due to a difference in physicochemical properties, for example, solubility or the 
possibility that the receptor-binding domain in the full length S1 is obscured and thus less accessible. 
A soluble form of ACE2 was shown to block the association of the S1 domain with Vero E6 
cells and an anti-ACE2 antibody inhibited SARS CoV replication in Vero E6 cells with an ECs of 
1.5 ug/mL."” The soluble ectodomain of ACE2 was shown to inhibit S protein-mediated viral entry in 
a pseudotype assay, °° while the catalytically inactive form of ACE2 conjugated to the Fc domain of 
human IgG1 potently inhibited SARS CoV infection in Vero cells, with an ECs of 2 nM. Similar 
receptor-based approaches using the HIV cellular receptor CD4 are being developed as anti-HIV 
therapies. PRO-540 is a CD4-IgG heteroconjugate that has completed a phase II clinical trail 
sponsored by Progenics,”! while TNX-355 is a humanized IgG4 anti-CD4 monoclonal antibody that 
is in phase II clinical studies.” 

The ACE2 protein is a potential target for therapeutic intervention but given the importance of 
this protein in cardiovascular function and its other unknown physiological roles, there is the 
potential for mechanism-based toxicities. Potent and selective small molecule inhibitors of ACE2 
have been discovered, including MLN-4760 (Fig. 3c), which is a picomolar inhibitor of ACE2 with 
>5,000 fold selectivity over ACE.’*”* On the other hand, ACE2 is insensitive to the inhibition by 
captopril and related ACE inhibitors.°°°””* The large conformational changes associated with the 
outer edge of the two ACE2 subdomains, likely S1-recognizing regions, upon binding of MLN-4760 
could disrupt the interaction between the S protein and ACE2 and, hence, interfere with viral entry. 
However, this hypothesis remains to be tested. Another small molecule, VE607 (Fig. 4), was 
identified using a phenotype-based screen in which Vero cells were infected with SARS CoV. This 
compound was subsequently evaluated in a Vero cell-based plaque reduction assay as well as 
pseudotype viral S protein/ACE2-mediated entry assay.” VE607 inhibited in both the viral entry and 
plaque reduction assays at micromolar concentrations and did not inhibit either SARS CoV 3CL 
protease or helicase. However, confirmation of the mode of action awaits the selection of resistant 
virus and resistant mutation mapping. 

The observation that SARS CoV does not infect certain cells that abundantly express ACE2 and 
also infects cells devoid of ACE2 expression””° suggests that other receptors or co-receptors are 
required for viral entry. Very recently, CD290L, a C-type lectin, type II transmembrane glycoprotein 
also known as L-SIGN, DC-SIGNR, and DC-SIGN2, was identified as a potential receptor that 
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Figure 4. Structure of VE607 andits anti-SARS CoV activities. 
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interacts with amino acids 1—590 of the S1 domain of the S protein of SARS CoV. However, CD290L 
is much less efficient than ACE2 in promoting SARS CoV entry’ but, importantly, this protein is 
expressed on type II alveolar cells and endothelial cells in human lung. Binding to the CD290L 
homolog, dendritic cell-specific intracellular adhesion molecule 3-grabbing non-integrin (DC- 
SIGN), was also observed using S protein pseudotyped virus.*° Dendritic cells promoted SARS CoV 
infectivity by transporting the virus to target cells but these cells themselves were not susceptible to 
infection. 


5. ANTIGENIC DETERMINANTS AND NEUTRALIZING 
MONOCLONAL ANTIBODIES 


The SARS CoV S protein contains 23 potential N-linked glycosylation sites> and surface 
carbohydrates are important for recognition by neutralizing antibodies.*! Antigenic determinants of 
the S protein that can elicit potent neutralizing antibodies were found to be located in both the S1 and 
S2 domains. The S1 peptide fragment represented by amino acids 441—700 expressed in an E. coli 
system’® and a synthetic peptide comprised of amino acids 603-634” reacted with serum samples 
from convalescent SARS patients, but not with those from healthy donors. A peptide from the ACE2- 
binding domain, amino acids 318-510, conjugated to the Fc domain of human IgG1 (RBD-Fc), 
obtained by expression of 293T cells transfected with the corresponding plasmid, induced potent 
antibody responses in immunized rabbits.°° These ACE-binding domain-directed antibodies 
inhibited the association of $1 to ACE2 and neutralized the infection of SARS CoV of Vero E6 cells 
in vitro.®° In more detailed studies, 27 monoclonal antibodies were isolated from mice immunized 
with RBD-Fc, 25 of which were found to recognize 6 different conformationally-defined epitopes 
while two recognized linear epitopes within the receptor-binding domain, amino acids 435—451 and 
449-458, respectively. Interestingly, only two groups of conformation-dependent antibodies could 
inhibit RDB-Fc binding to soluble ACE2 and ACE2-transfected 293T cells. These antibodies 
potently neutralized SARS CoV pseudovirus infection of ACE2/293T cells with nanomolar ECs s, 
while the two antibodies that recognized linear epitopes exhibited only weak neutralization 
properties.*' These results indicate the antigenic heterogeneity of a receptor-binding domain that 
contains multiple neutralization epitopes. Human monoclonal antibody CR3014, which was 
identified through screening of an antibody phage display library for binding to y-irradiated, whole 
SARS CoV virions, neutralized the infectivity of SARS CoV towards Vero cells with nanomolar 
potency.*” The level of neutralization correlated with the binding affinity to a peptide comprised of 
amino acids 1—565 of the S1 protein. More specifically, CR3014 bound to peptides comprised of 
amino acids 318-510 from nine human SARS CoV isolates. Since there was approximately 50% 
reduction in the binding of CR3014 to the peptide derived from the BJ302cl.2 strain that possessed a 
N479S mutation, it was suggested that this residue was important for the binding of CR3014. These 
results also indicate the importance of evaluating the antiviral spectrum of anti-SARS CoV 
monoclonal antibodies. Human non-immune recombinant monoclonal antibody 80R neutralized 
SARS CoV infection of Vero E6 cells in vitro.®° The 80R epitope was conformationally dependent 
and located between amino acids 324 and 503 of the S protein,** encompassing the ACE2 receptor- 
binding domain, a result consistent with the observation that 830R was competitive with soluble ACE2 
for binding to the SI domain of the S protein. The recorded Kg of 32.3 nM could be enhanced to 
1.59 nM, affinity comparable to the Ky of native ACE2, by conjugation to whole human IgG1. This 
80R-IgG1 variant was 20-fold more effective in neutralizing the SARS CoV infection of Vero E6 
cells, with an ECs 9 of 0.37 nM, which compares to an ECs of 7.43 nM for 80R.% MAb 201 isa fully 
humanized monoclonal antibody that was isolated from transgenic mice with human Ig genes 
immunized with a recombinant ectodomain of the S protein, amino acids 1—1190. This antibody 
neutralized infection Vero E6 cells by SARS CoV at nM concentrations.** MAb 201 bound to full 
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length S protein expressed on transfected HEK-293T/17 cells with a 50% binding at 1 nM, and to a 
soluble fragment of S protein (amino acids 1-590) with a Ky of 34 nM, as measured by Biacore 
technology. The neutralization epitope of MAb 201 was mapped to amino acids 490-510 of the CoV 
S protein, again within the ACE2 receptor-binding domain. Although conformationally-defined 
epitopes are very important for eliciting neutralizing antibodies, as mentioned earlier, studies using 
denatured S protein fragments suggested that MAb 201 may recognize linear epitopes on the 
receptor-binding domain of the S protein.** Two E. coli-expressed peptide fragments of the S protein, 
amino acids 447—455 and 789-797, were recognized by mouse monoclonal antibodies D3D1 and 
D305, respectively.*° These sub-segmenting studies suggested that the nine amino acid sequence 
447-455 of the S protein may constitute the minimal epitope of the receptor-binding domain and 
signify that inhibition of receptor-binding constitutes a key neutralization mechanism for anti-SARS 
CoV vaccines. For example, the major neutralizing epitope of antibodies elicited in mice, rabbits, and 
monkeys by immunization with an anti-SARS vaccine (ADS-MVA), obtained using an attenuated 
vaccinia virus Ankara expressing the full length SARS CoV S protein, was mapped to amino acids 
400-600. This region encompasses the ACE2 receptor-binding domain of S1 and these antibodies 
were shown to bind to RBD-Fc protein.*° Administration of this vaccine to monkeys provided 
protection against a SARS CoV challenge, with no detectable virus shedding on days 4 and 6 post- 
challenge in all the four study animals. Monoclonal antibodies 1A5 and 2C5 were isolated from mice 
immunized with an inactivated SARS CoV, and their epitopes mapped to the amino acid sequence 
310-535 of the S protein. These two antibodies showed much higher neutralizing titers against SARS 
CoV infection of Vero E6 cells than other antibodies that recognize other regions of the S1 domain or 
the S2 domain.*° Vaccination of mice with a DNA plasmid encoding a form of the S protein that 
included the transmembrane domain, elicited humoral, and cellular immune responses.®’ However, it 
was demonstrated that neutralizing antibodies, not T-cells, provided the immune protection against 
pulmonary viral infection in murine challenge studies using SARS CoV. In this experiment, a greater 
than 6 logo reduction in viral load in the lung was observed in treated mice compared to control 
animals. One of the neutralizing epitopes identified in antibodies from rabbits exposed to these DNA 
vaccines was mapped to amino acids 12—535.*° A S1 domain peptide fragment (designated as S-II), 
comprised of amino acids 485-625, was obtained by expression in E. coli. S-II was shown to bind to 
Vero E6 cells and to induce monoclonal antibodies in mice. All of the four monoclonal antibodies 
isolated recognized both the monomeric and the trimeric native forms of SARS CoV S protein, 
and two of these completely inhibited the SARS CoV infection of Vero E6 cells at a concentration of 
10 pg/mL. Interestingly, two of the neutralizing epitopes identified were in the region of amino acids 
548-567 and 607-627 of the S-II peptide,*? both of which are located outside of the receptor-binding 
domain. 

The amino acid sequence 787-809 of the SARS CoV S protein was identified as one of 
the epitopes recognized by antibodies from convalescent SARS patients.** The S2 domain peptide 
fragment comprised of amino acids 803-828 elicited antibodies in rabbits and mice that have 
neutralizing effects towards a SARS-CoV pseudovirus infection of Huh-7 cells.*? Other studies also 
suggested that epitopes in the C-terminal region of the HR2 domain are susceptible to antibody 
responses. Other neutralizing epitopes to emerge from DNA vaccine-induced antibody generation 
in rabbits were mapped to amino acids 797-1192, also outside the ACE2-binding region.*° 
Antibodies raised in rabbit against an E. coli-expressed peptide comprising amino acid residues 
1055-1192 that encompass the HR2 region of the S2 domain, specifically recognized the full length, 
mature S protein expressed on a cell surface, and were shown to possess strong neutralizing activity 
against a SARS-CoV infection of Vero E6 cells.”° Since the antibodies raised in rabbit against another 
E. coli-expressed peptide based on amino acid residues 1061-1093 did not possess neutralizing 
activity towards a SARS-CoV pseudovirus infection of Huh-7 cells®® and antibodies elicited by the 
peptide fragment of amino acids 798-1055 also did not neutralize SARS-CoV infectivity,”° an 
epitope is likely located in the 100 amino acids residue 1093-1192 of the HR2 region. 
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In addition to antibody epitopes, T-cell epitopes are also located on the S protein. Transgenic 
mice treated with an adenovirus vector-based SARS CoV pseudovirus showed a CD8 T-cell response, 
which was found to be induced by H-2K°-restricted epitopes (amino acids 436—443 and 525-532), 
and H-2K°-restricted epitopes (amino acids 366-374) of the S1 domain.”’ Amino acid sequences 
978-986 and 1203-1211 were identified as HLA-A2-restricted epitopes.°” Synthetic peptides 
derived from these two sequences induced a high frequency of IFN-y-secreting T-cell responses in the 
PBMCs of HLA-A2-+ convalescent SARS patients. A nine amino acid peptide within the S2 domain, 
amino acids 1167-1175 (RLNEVAKNL), elicited-specific HLA-A*0201-restricted CD8 cytotoxic 
T lymphocytes both in peripheral blood lymphocyte preparations and transgenic mice.”* 

The identification of the antigenic determinants of the S protein has implications for vaccine 
design and development, including avoiding antibody-enhanced viral infectivity. Indeed, the 
potential for escalation of viral infection by SARS vaccines is currently a major clinical concern. For 
example, recent immunization studies conducted in ferrets with a candidate SARS CoV vaccine 
based on vaccinia virus showed signs of worsening the liver inflammation induced by SARS CoV 
infection.” The strong neutralizing immune response induced by peptide fragments of the receptor- 
binding domain described above also point to the sub-unit vaccine approach. The antigenic 
determinant studies presented above also suggest that monoclonal antibodies that target the S protein 
could, in principle, provide yet another effective approach to the inhibition of SARS CoV viral entry. 
Further support for the use of anti-SARS CoV monoclonal antibodies, in addition to the results from 
the DNA vaccine methodology discussed above, come from experimental results in mice where 
antibodies alone can prevent SARS CoV replication in the lung, particularly in the lower respiratory 
tract.°* These results correlate with the neutralizing effects observed with SARS CoV S pseudotype 
assays in vitro.“' Moreover, S protein-specific neutralizing monoclonal antibodies were isolated from 
a patient who recovered from a SARS infection, among which $3.1 (IgG1 « isotype) was shown to 
protect against SARS CoV replication in the respiratory tract in a mouse model.”° More encouraging 
results came from three animal studies with CR3014, 80R-IgG1, and MAb 201. Ferrets administered 
10 mg/kg of CR3014 had a 3.3 logj9 lower viral load in the lung and better lung pathology ina SARS 
CoV challenge study.”° Administration of 2.5 mg/kg of 80R-IgG1 to mice | day prior toa SARS CoV 
challenge produced a 4 logjg drop in viral load in the lower respiratory tract compared to control 
animals, whereas using a 12.5 mg/kg dose reduced the viral load to below the detection limit.°? MAb 
201 protected mice from infection by live SARS CoV, demonstrating a greater than 2 log; reduction 
in viral load in lung tissue at a dose of 1.6 mg/kg, while undetectable levels of virus were observed at 
doses of 8 and 40 mg/kg of MAb 201. Consequently, clinical trials were planned for MAb 201.84 
However, further studies of this and other monoclonal antibodies will be needed in order to evaluate 
the immunosafety, anti-SARS CoV spectrum, and efficacy in additional animal models. Although the 
prophylactic efficacy of anti-SARS CoV monoclonal antibodies in animals has been established, 
future studies will also be required to assess the safety and therapeutic efficacy in treating established 
infections. A very recent report using a pseudotype assay revealed escape from antibody 
neutralization by a SARS CoV virus isolated from a patient in late 2003 (GD03T0013) and from 
two palm civets (SZ3 and SZ16), which are proposed to be the natural animal reservoir for human 
SARS CoV. The antibodies used were competent for neutralizing human strains identified early 
during the SARS outbreak and raise some cause for concern.”’ Interestingly, the S-mediated entry of 
the pseudotype virus derived from the late 2003 example was less sensitive to inhibition by a soluble, 
recombinant human ACE2 ectodomain, while that derived from the two civets was enhanced by 
antibodies through their interactions with the receptor-binding domain. Antibody enhancement of 
viral entry, however, has not been observed with human strains of SARS CoV.”’ A specific study 
using 80R-IgG1, showed that pseudotype SARS CoV derived from GD03T0013, but not SZ3, was 
completely resistant to neutralization as a result of the D480G mutation,® a residue critical for 80R 
binding to the S1 domain, The implications of genetic mutations in the S protein to vaccine 
development and the use of antibody therapeutics remains to be determined. 
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6. CONCLUSION 


A model for the entry of the SARS CoV into host cells based on the studies reported to date can be 
envisaged. Upon binding of the S1 domain of the trimeric S protein to the host cell receptor ACE2, the 
$2 domain, which carries the internal fusion peptide, is exposed, leading to the insertion of the fusion 
peptide into the host cell membrane. The existence of other receptors or co-receptors will require 
further investigation. A conformational rearrangement of the $2 domain at this pre-hairpin state may 
be induced by further interaction between the S1 domain and the receptor, whereupon the three HR2 
regions wrap around the trimeric HR1 coiled coil, bringing the viral and host cell membranes into 
close proximity. The question of whether endocytosis of the virus occurs, and whether or not the 
conformational changes are promoted by low pH or certain unidentified proteases in the endosome 
still need to be resolved.?® However, it is conceivable that multiple entry mechanisms may be 
exploited by the SARS CoV to establish an infection. During the process of hairpin formation by the S 
protein, the lipid bilayer is destabilized, and membrane mixing occurs along a hydrophobic pathway 
guided by the fusion peptide, the viral membrane proximal aromatic region, and the transmembrane 
anchor. This results in the formation of a fusion pore through which the nucleocapsid containing the 
viral RNA enters the host cell. The SARS CoV entry process offers several opportunities for 
therapeutic intervention, including blockade of the binding of the S protein S1 domain to the ACE2 
receptor, inhibition of the HR1-HR2 association, and hairpin formation or neutralization of 
functional epitopes by monoclonal antibodies. Structural information provided by crystallographic 
structures of ACE2, with and without bound inhibitor, the SARS CoV HR1—HR2 core complex, and 
the identification of the antigenic determinants on S protein offer potential to accelerate the design of 
effective therapeutic agents. Certainly, crystallographic structures of a complex of $1 and ACE2”” as 
well as S protein-monoclonal antibody complexes, should they be obtained in the future, will provide 
critical insights. As described, the inhibition of the SARS CoVentry process is feasible, although the 
inhibitors identified to date are only preliminary in nature. The current understanding of the SARS 
CoV entry process, the structural information now available and the emergence of inhibitors together 
form a solid basis for future drug discovery, of importance if the global community is to be adequately 
prepared for another SARS outbreak. 
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